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ABSTRACT 

The discovery of hydroxymethyl-, formyl- and 
carboxylcytosine, generated through oxidation of 
methylcytosine by TET dioxygenases, raised the 
question how these modifications contribute to epi- 
genetic regulation. As they are subjected to complex 
regulation in vivo, we dissected links to gene ex- 
pression with in vitro modified reporter constructs. 
We used an Oct4 promoter-driven reporter gene and 
demonstrated that in vitro methylation causes gene 
silencing while subsequent oxidation with purified 
catalytic domain of TET1 leads to gene reactiva- 
tion. To identify proteins involved in this pathway 
we screened for TET interacting factors and identi- 
fied TDG, PARP1, XRCC1 and LIG3 that are involved 
in base-excision repair. Knockout and rescue ex- 
periments demonstrated that gene reactivation de- 
pended on the glycosylase TDG, but not MBD4, while 
NEIL1 , 2 and 3 could partially rescue the loss of TDG. 
These results clearly show that oxidation of methyl- 
cytosine by TET dioxygenases and subsequent re- 
moval by TDG or NEIL glycosylases and the BER 
pathway results in reactivation of epigenetically si- 
lenced genes. 



INTRODUCTION 

DNA methylation at the C5-position of cytosine plays an 
essential role in a variety of fundamental processes, such 
as early embryonic development, X-chromosome inactiva- 
tion, genome stability and imprinting (1,2). In vertebrates, 
this epigenetic modification is set by the three DNA methyl- 
transferases DNMTl, DNMT3A and DNMT3B and the 
regulatory subunit DNMT3L (3-5). 

Recently, it was discovered that the TET family of Fe(II)- 
and 2-oxoglutarate-dependent dioxygenases can succes- 
sively convert 5-methylcytosine to 5-hydroxymethylcytosine 
(hmC), 5-formylcytosine (fC) and 5-carboxylcytosine (caC) 
in vitro and in vivo (6-8). Three different TET proteins 



(TETl, TET2 and TET3), each showing tissue-specific dif- 
ferential expression (9), have been identified in mouse and 
human (10). Functional studies indicate that they are in- 
volved in a variety of cellular processes including epigenetic 
reprogramming, differentiation, myelopoiesis and imprint- 
ing (11-13). Mutations of TET2 correlating with lower hmC 
levels and altered gene expression patterns have been linked 
to various hematopoietic malignancies (14,15). 

The discovery of TET proteins and their catalytic prod- 
ucts hmC, fC and caC has raised the question about the 
functions of these oxidized cytosine variants. They might 
serve as independent epigenetic signals and have been 
shown to recruit a distinct and dynamic set of 'reader' pro- 
teins in embryonic stem cells (ESCs) and differentiated cells 

(16) . It also has been described that cytosine oxidation af- 
fects the efficiency of transcription by RNA polymerase II 

(17) . However, the low abundance of fC and caC suggests 
that these cytosine variants are quickly processed in vivo 
and have been proposed to be intermediates in active DNA 
demethylation (18,19). 

Whereas the mechanism of setting the methylation mark 
is well understood, the process of its removal has long been 
elusive. DNA demethylation may either occur by a passive 
process via the inhibition of DNMTl maintenance methy- 
lation after replication (20-22) or by an active enzymatic 
reaction. In principle, there are three possibilities: first, the 
direct removal of the methyl group, second, the excision of 
either the methylated cytosine or third, of the entire nu- 
cleotide. It is currently proposed that the additional oxi- 
dized cytosine derivatives hmC, fC and caC are intermedi- 
ates in active DNA demethylation, thereby contributing to 
epigenetic plasticity and transcriptional regulation (23,24). 

Several biochemical studies revealed that thymine DNA 
glycosylase (TDG) can specifically bind to and excise fC 
and caC, resulting in abasic sites, which might be subse- 
quently processed by the base-excision repair (BER) ma- 
chinery (18,25). In general, the BER pathway repairs dam- 
aged DNA sites through recognition and excision of base le- 
sions by substrate-specific glycosylases. The generated aba- 
sic site is subsequently cleaved by the AP endonuclease 1 
(APEXl), leading to a single-strand break, which is recog- 
nized by PARPl through its N-terminal zinc fingers. PARPl 
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then recruits XRCCl, LIG3 and DNA polymerase beta 
to complete the BER reactions (26-28). TDG depletion in 
mice causes embryonic lethality, and TDG deficient ESCs 
display prominent alterations of CpG modifications at a 
large number of gene regulatory regions (29,30). 

Another discussed alternative for DNA demethylation is 
based on the initial deamination of hmC to hydroxymethy- 
luracil (hmU) by members of the AID/APOBEC cytidine 
deaminase family (23). In the following step, hmU might 
be excised either by TDG, methyl-CpG-binding domain 
protein 4 (MBD4) or the single-strand-specific monofunc- 
tional uracil-DNA glycosylase 1 (SMUGl) (31-33). How- 
ever, there is evidence that AID/APOBEC members are 
less active on modified cytosines in vitro or in vivo, chal- 
lenging the prominence of the proposed deamination-linked 
demethylation pathway in living cells (34). Furthermore, a 
direct decarboxylation of caC to unmodified cytosine has 
been detected in ESC lysates, but no specific decarboxylase 
has been identified so far (35). 

In addition to TDG, two members of the NEIL family of 
glycosylases (NEILl and NEIL3) have recently been iden- 
tified as potential binders for oxidized cytosine derivatives 
(16). However, their function in TET-dependent demethy- 
lation has not been investigated to date. 

To unravel the effects of TET-mediated cytosine oxida- 
tion on gene expression, we generated in vitro modified 
pOct4-reporter plasmids and monitored their in vivo expres- 
sion in ESCs. Whereas methylation of the reporter DNA 
leads to silencing of gene expression, subsequent oxidation 
results in gene reactivation. We show that TET proteins in- 
teract with BER factors in vivo and propose that the ob- 
served oxidation-dependent gene reactivation requires the 
BER machinery. We demonstrate that initiation of this 
pathway is mainly dependent on TDG activity, but not 
on MBD4. Our results also indicate that the glycosylases 
NEILl, NEIL2 and NEIL3 can contribute to an alterna- 
tive BER pathway for DN A demethylation and cause gene 
reactivation. 

MATERIALS AND METHODS 

Cell culture and transfection 

Human embryonic kidney 293T (HEK293T) and baby 
hamster kidney (BHK) cells containing a stably integrated 
lac operator array (36) were cultured in Dulbecco's modi- 
fied Eagle s medium (Sigma) supplemented with 10% Fetal 
Calf Serum (FCS) (Biochrom) and 50 jJLg/|xl gentamycine 
(PAA). HEK293T and BHK cells were transiently trans- 
fected using polyethylenimine pH 7.0 (Sigma) according to 
the manufacturer's instructions. 

Mouse wild-type (wt) E14 as well as Tdg-/- and Mbd4-/- 
ESCs (29,37) were cultured on gelatin coated flasks or op- 
tical 96-well plates (Greiner) using 1000 U/ml LIE, l|xM 
PD032591 and 3 jjlM CHIR99021 (Axon Medchem, (38)). 
ESCs were transfected with Lipofectamine 2000 (Invitro- 
gen) according to the manufacturer's instructions. 

In vitro methylation and oxidation of plasmid DNA 

In vitro methylation of pOct4-GFP plasmid DNA was per- 
formed using M.SssI methyltransferase (New England Bi- 



olabs) according to the manufacturer's instructions. The 
methylation status of the plasmid was tested by Hpall and 
Mspl (Fermentas) digestion. 

For the in vitro oxidation, GFP-TETICD or GFP- 
TETICD"""' (H1652Y, D1654A) was purified from mam- 
malian cells. In detail, HEK293T cells were transfected with 
an expression construct for GFP-TETICD/TETICD™' 
and immunoprecipitation was carried out using GBP-Ni- 
NTA beads. Proteins were eluted using imidazole. The 
in vitro methylated plasmid was diluted in TET reaction 
buffer (50 mM HEPES pH 8.0, 75 |jlM Fe(II), 2 mM 
Sodium-Ascorbate, 1 mM Di-Sodium-Ketoglutarate (39)) 
and added to the purified GFP-TETICD. 

Digestion of hydroxy methylated plasmid with PvuRtslI 

A total of 200 ng oxidized plasmid DNA and 100 ng of 
reference DNA fragments containing exclusively unmodi- 
fied C, mC or hmC were digested with PvuRtslI (150 mM 
NaCl, 20 mM Tris pH 8.0, 5 mM MgCla, 1 mM DTT) at 
22°C for 20 min (40). The reaction was inactivated at 65°C 
for 10 min and digestion of the samples was analyzed by 
agarose -gelelectrophoresis. 

Co-immunoprecipitation (Co-IP) using the GFP-Trap 

Note that 36 h after transfection, whole cell lysates of 
HEK293T cells were prepared using RIPA-lysis buffer (50 
mM Tris pH 7.0, 150 mM NaCl, 0.1% sodium dodecyl sul- 
phate (SDS), 0,5% sodium deoxycholate, 5 mM ethylene- 
diaminetetraacetic acid (EDTA), 2.5 mM MgCh, 0.5 mM 
CaCl2, 2 mM PMSF, Ix Mammalian Protease Inhibitor 
Cocktail and 1 ixg/jjil DNasel). After centrifugation, 10% 
of the supernatant was collected as input fraction and the 
remaining supernatant was diluted in IP-buffer (10 mM 
Tris/Cl pH 7.5, 150 mM NaCl and 0.5 mM EDTA) to 800 
|jl1. Green Fluorescent Protein (GFP)-Trap beads (Chro- 
motek, (41)) pre-equlibrated with IP-buffer were added to 
the supernatant dilution and rotated for 1.5 h at 4°C. The 
GFP -beads were centrifuged and 10% of the supernatant 
was collected as flowthrough fraction. For western blot- 
ting, input, flowthrough and bead fractions were boiled 
with Laemmh buffer at 95°C for 10 min, loaded on an SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE) and trans- 
ferred to a nitrocellulose membrane (Biorad). Immunode- 
tection was performed using mouse monoclonal anti-GFP 
(Roche, 11814460001) or rat monoclonal anti-RFP anti- 
bodies (42) and Alexa488/Alexa594 coupled secondary an- 
tibodies (Jackson ImmunoResearch). 

For mass spectrometry analysis, protein samples were de- 
natured with GdnHCl, precipitated with acetone and di- 
gested with trypsin. Peptide mixtures were analyzed us- 
ing electrospray tandem mass spectrometry in collaboration 
with the Core Facility of the Max-Planck-Institute for Bio- 
chemistry, Martinsried. Experiments were performed with 
an LTQ Orbitrap mass spectrometer (Thermo Scientific). 
Spectra were analyzed with MaxQuant (43). 

Fluorescent-three-hybrid assay (F3H) 

Transgenic BHK cells containing stably integrated lac- 
operator repeats (36) were grown to 60-70% confluence 
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on coverslips. The cells were transiently cotransfected 
with expression constructs encoding for LacI-GBP, murine 
RFP/mCherry- and GFP-fusion proteins (44). As controls 
the catalytically inactive mutants gFP-TET1"'*'52y&di654A^ 

GFP-TET2H1304Y&D1306A and GFP-TET3™50Y&D952A ^g^g 

used. Note that 24 h after transfection cells were fixed with 
4% formaldehyde in phosphate buffered saline (PBS), per- 
meabilized with 0.5% Triton X-IOO/PBST, counterstained 
with 4',6-diamidino-2-phenylindole (DAPI) and mounted 
in Vectashield (Vector Laboratories). Imaging was per- 
formed using a Leica TCS SP5 confocal laser scanning mi- 
croscope with a 63x/1.4 NA Plan-Apochromat oil immer- 
sion objective. 

The Operetta automated imaging system (PerkinElmer) 
was used for F3H quantification (Harmony 3.6 software). 
After imaging, nuclei were detected based on DAPI signal. 
The lacO-spot was defined in the GFP channel and screened 
for enrichment at the RFP channel (intensity spot >1.2x 
mean intensity nucleus; see also Supplementary Figure S4). 

High-throughput pOct4-reporter gene expression analysis 

Wild type, Tdg-/- and Mbd4-/- ESCs were transiently trans- 
fected with unmodified, M.SssI methylated or in vitro oxi- 
dized pOct4-GFP plasmid DNA on coated optical 96-well 
plates (PerkinElmer). Note that 24 h after transfection, the 
cells were fixed with 4% formaldehyde/PBS, permeabilized 
with 0.5% Triton X-IOO/PBST and counterstained with 
DAPI. 

Images were acquired using the Operetta automated 
imaging system with an 40x high NA objective and 
expression was quantified using Harmony 3.6 software 
(PerkinElmer). A total of 16 fields per well were imaged, 
cells were counted and segmented into nuclei and cytoplasm 
on the basis of DAPI and reporter mCherry/GFP signal. 
Prism software (GraphPad) was used for statistical analy- 
sis. 

Generation of stable transgenic cell lines 

Tdg-/- ESCs stably expressing GFP-fusions of wt TDG, 
TDgni5ia^ TDG^ie™ and TDGI^^soh ^^^^ generated by 

transfecting the respective plasmids in the presence of the 
selection marker blasticidine followed by repeated sort- 
ing for GFP expression with the fluorescence-activated cell 
sorting (FACS) Ariall (Becton-Dickinson) system. Single 
cell sorting was used to generate clonal transgenic cell lines. 
GFP-expression of the single cell clones was analyzed using 
the Operetta system or western blotting. 

Activity of GFP-TDG in vitro 

GFP-TDG and the different mutants were expressed 
in HEK293T cells and immunoprecipitated as described 
above. Equal amounts of GFP-tagged protein immobilized 
on GFP-Trap beads were incubated with 0.4 jjlM of DNA 
substrate in TDG reaction buffer (20 mM TrisHCl, pH 
8.0 or pH 6.5, 100 mM NaCl, 1 mM MgCh, 0.2 mM 
EDTA, 1 mM DTT, 0.01 mM ZnCl2). In detail, these 
DNA substrates were fluorescently (ATTO550) labeled 42 
bp oligonucleotides (GGA TGA TGA CTC TTC TGG 



TCC GGA TGG TAG TTA AGT GTT GAG) (Eurofins 
MWG Operon) with a central modified CpG site: either 
hmCpG, fCpG or caCpG or harboring a GoT mismatch at 
this site. Incubation was performed in the presence of pu- 
rified GFP- APEX 1 for 2 h at room temperature. Follow- 
ing heat-inactivation of TDG (2 min, 95°C), fresh GFP- 
APEXl was added and further incubated for 4 h at room 
temperature. An oligonucleotide containing a deoxyabasic 
site ('dSpacer', Eurofins MWG Operon) served as a pos- 
itive control for APEXl activity. DNA was analyzed on 
a denaturing 17% polyacrylamide gel with the Typhoon 
TRIO (GE Healthcare Life Sciences). Quantification was 
performed with Image! 

Genomic DNA extraction and slot blot analysis 

Genomic DNA from ESCs was extracted using the Blood & 
Cell Culture Midi Kit (Qiagen) according to the manufac- 
turer's instruction. Anti-oxidant BHT (200 |jlM, Sigma) and 
deaminase inhibitor THU (200 |xM, Sigma) were added to 
the lysis buffer and elution buffer. The Bio-Rad slot blot sys- 
tem was used according to the manufacturer's instruction. 
Nitrocellulose membranes (Amersham) were crosslinked, 
blocked with 5% milk and immunostaining was performed 
using a mouse monoclonal antibody against mC (Eurogen- 
tec, 33D3) or rabbit polyclonal antibodies against hmC, fC 
and caC (Active motif: 39791, 61233, 61224). Alexa488- 
coupled secondary antibodies were used for detection and 
the membranes were scanned with the Typhoon TRIO (GE 
Healthcare Life Sciences). Quantification was performed 
with Image! 

Re-isolation of transfected plasmids 

Note that 36 h after transfection, nuclei were extracted from 
the ESCs using the Blood & Cell Culture Midi Kit (Qia- 
gen) according to the manufacturer's instructions. Plasmid 
DNA was re-isolated using the Qiaprep Plasmid Midi Kit 
(Qiagen). A total of 200 ng of re-isolated plasmid DNA was 
digested with 0.5 |jl1 Hpall (Fermentas). 

RESULTS 

In vitro oxidation of Oct4 reporter plasmid DNA causes gene 
activation 

Since the discovery of hmC, fC and caC, two major roles 
for these cytosine modifications have been proposed: first, 
as intermediates in active DNA demethylation, and second, 
as independent epigenetic marks. The latter has been inves- 
tigated by identifying reader proteins for hmC, fC and caC 
in different murine tissues. These new DNA modifications 
recruit a dynamic set of readers including DNA repair fac- 
tors and chromatin remodelers (16). TET-dependent cyto- 
sine oxidation has been shown to occur at a large number 
of gene regulatory elements and repetitive loci (30). 

Here, we focus on the effects of hmC, fC and caC on 
gene expression. We transfected ESCs with Oct4 promoter- 
driven GFP and mCherry reporter plasmids (pOct4-GFP 
or pOct4-mCherry) in different modification states and an- 
alyzed expression of the reporter by quantitative imag- 
ing. To generate the different cytosine modifications in 



Nucleic Acids Research, 2014, Vol. 42, No. 13 8595 



vitro, the unmodified pOct4-repoiter construct (CpG) was 
initially treated with the DNA-methyltransferase M.SssI, 
thereby creating fully methylated CpG sites (™CpG), and 
subsequently incubated with the purified catalytic domain 
of TETl (TETICD) to create oxidized cytosine residues 
(""CpG; Figure la). Specific restriction enzymes were used 
to monitor the methylation and oxidation state of the plas- 
mid DNA. The in vitro methylation of the reporter con- 
struct was confirmed by the mC-sensitive restriction en- 
donuclease Hpall and the mC-insensitive enzyme Mspl. 
Both Mspl and Hpall fully cleave the unmethylated pOct4- 
reporter plasmid at CCGG sites, whereas the restriction pat- 
tern of the M.SssI-methylated reporter only appeared with 
Mspl digestion (Figure lb). 

To follow the oxidation of mC to hmC by TETICD, the 
reporter DNA was treated with the hmC-specific endonu- 
clease PvuRtslI (40) (Supplementary Figure SI a). Treat- 
ment with TETICD resulted in a gradual increase of hmC 
levels after 15, 45 and 90 min of incubation, visible as pro- 
gressing fragmentation (Figure lb). While using a catalyti- 
cally inactive TETICD mutant (TETICD"""') as a control, 
no hmC levels were detected (°''*CpG), indicating specific 
enzymatic mC oxidation by TETICD (Supplementary Fig- 
ure Sib). 

Consistent with the restriction digest results, methylation 
and oxidation of the reporter plasmid could also be shown 
by slot blot analysis (Figure Ic). Incubation of methy- 
lated plasmid DNA with TETICD resulted in an increase 
of not only hmC but also of fC and caC, demonstrating 
that purified TETICD did carry out the three oxidation 
steps in vitro (Figure Ic and d). The mC signal decreased 
over time as hmC, fC and caC appeared in the presence 
of active TETICD, while remaining constantly high with 
TETICD™' (Supplementary Figure Sid). 

Transfection of ESCs with the TETl CD-treated plasmid 
DN A allows to directly monitor the effect of the oxidized 
cytosines on gene expression, independent from in vivo TET 
activity. Reporter gene expression from either unmodified, 
in vitro methylated or oxidized pOct4-mCherry was visual- 
ized using confocal imaging or automated image acquisi- 
tion for quantification. Transfection of ESCs with unmod- 
ified pOct4-mCherry resulted in a strong nuclear and cy- 
toplasmic expression of the reporter, whereas expression 
drastically decreased when using the methylated construct. 
Interestingly, prominent reporter expression could be ob- 
served upon transfection of the oxidized plasmid DNA, but 
not with the '"'*CpG reporter DNA (Figure le). This sug- 
gests that reactivation of gene expression requires oxidation 
of methylcytosine by TET proteins. 

TET proteins interact with the BER machinery 

Currently, three pathways for TET-mediated active DNA 
demethylation are discussed: TDG-dependent BER, 
deamination-dependent BER and direct decarboxylation 
of caC (18,23,35). Since the proteins responsible for gene 
reactivation in our assay might be physical interaction 
partners of the TET proteins, we performed an initial 
unbiased screen for interactors. We expressed GFP-TETl 
in HEK293T cells, performed immunoprecipitation and 
analyzed the co-precipitated proteins by mass spectrome- 



try Prominently, we found PARPl, XRCCl and LIG3, a 
subset of proteins involved in the BER pathway (Supple- 
mentary Figure S2a). These results point toward the two 
BER-dependent demethylation mechanisms. 

To further investigate the interplay between TET pro- 
teins and BER, we systematically performed Co-IP and a 
recently established F3H assay of all three TET proteins 
with the following BER factors: TDG, MBD4, SMUGl, 
NEILl, NEIL2, NEIL3, PARPl, LIG3 and XRCCl. 
F3H allows to directly visualize protein-protein interac- 
tions in living cells (44). We therefore co-expressed full- 
length GFP-TET fusion proteins together with potential 
mCherry/RFP-tagged interactors (Supplementary Figure 
S3a) and LacI-GBP in BHK cells containing a stably inte- 
grated lac-operator array (45). The GFP-tagged bait is en- 
riched at the lac-operator array via LacI-GBP and is visible 
as a single spot inside the nucleus. If the mCherry/RFP- 
tagged prey protein interacts with the bait, it colocalizes at 
the same spot (Figure 2a). 

Consistent with the mass spectrometry results, inter- 
actions of TETl, TET2 and TET3 were observed with 
PARPl, LIG3 and XRCCl both in Co-IP and F3H. Inter- 
estingly, several glycosylases also showed a clear interaction 
in both assays, among them TDG, MBD4, NEILl, NEIL2 
and NEIL3, but not SMUGl (Figure 2b and c; Supplemen- 
tary Figures S2b and S3b, c). Automated high-throughput 
image analysis was used to quantify the F3H results (Sup- 
plementary Figure S4). The interaction of all three TET 
proteins with TDG was the most robust and detectable in 
more than 80% of all analyzed cells. For the other factors, 
numbers vary between 40% and 75% (Figure 2d). 

To exclude that the recruitment to the lacO-spot is de- 
pendent on the locally enriched cytosine oxidation products 
generated by TET proteins, we repeated the F3H quantifi- 
cation with catalytically inactive TET mutants (Figure 2e). 
The percentage of cells showing an interaction did hardly 
differ compared to the assay with active TET proteins. The 
only exception is XRCCl where only half as many cells dis- 
played a colocalization at the spot, suggesting a potential 
cooperative effect. 

Taken together, these results suggest that TET proteins 
physically interact with the BER machinery and are there- 
fore able to recruit these factors to the site of cytosine oxi- 
dation for immediate removal of the modified base in vivo. 

TDG but not MBD4 mediates oxidation-dependent gene re- 
activation in ESCs 

Since DNA glycosylases catalyze the first step of the BER 
pathway and are therefore the initiators of TET-dependent 
cytosine demethylation, we investigated their role in gene 
reactivation with the reporter gene assay. Besides TDG, 
also MBD4 has been implicated in DNA demethylation via 
excision of hmU, the deamination product of hmC (46). 
To investigate the role of these two glycosylases, we trans- 
fected Tdg-/- and Mbd4-/- ESCs (29,37) with either un- 
modified, in vitro methylated or oxidized pOct4-mCherry 
plasmid DNA. In contrast to wt E14 ESCs, Tdg-/- ESCs 
showed no reporter gene expression from the oxidized plas- 
mid. However, Mbd4 knockouts were able to fully reacti- 
vate gene expression from the oxidized reporter construct. 
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Figure 1. In vitro oxidation of mC causes gene reactivation in ESCs. (a) Schematic representation of in vitro reporter DNA modification: Unmethylated 
pOct4-reporter DNA was methylated using the CpG methykransferase M.SssI. Incubation with purified TETICD results in oxidation of mC sites to hmC, 
fC and caC. (b) M.SssI treatment of pOct4-mCherry results in full methylation as shown after restriction with the methylation sensitive enzyme Hpall. 
Mspl cuts irrespective of the methylation state. The hmC-specific restriction endonuclease PvuRtsll detects increasing hmC levels during incubation of 
methylated pOct4-mCherry with TETICD. (c) Cytosine modification states of untreated, methylated and TETICD oxidized pOct4-mCherry plasmid 
DNA were detected by slot blot. A 2-fold serial dilution of the plasmid DNA was loaded and detected using antibodies against mC, hmC, fC and caC. 
A gradual increase of hmC, fC and caC signals was obtained with longer incubation time with TETICD while the mC signal decreases accordingly, (d) 
Quantification of the slot blot signals of pOct4-mCherry after treatment with TETICD shows increasing oxidation of mC to hmC, fC and caC. The sum 
of all CpG modification signals was set to 100%. Error bars indicate standard deviation {n = 3). (e) ESCs were transfected with pOct4-mCherry plasmids 
containing either unmodified (CpG), methylated (mCpG), TETl CD-oxidized (""CpG) or TETlCD™"'-treated (°''*CpG) cytosines. Confocal imaging and 
quantification show reporter gene silencing upon methylation and reactivation upon oxidation. Cells were fixed with formaldehyde and counterstained 
with DAPI. Scale bar: 5 jjim. (Right: n = 200 000; error bars indicate standard deviation). 
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Figure 2. All three TET proteins interact with a variety of BER factors, (a) Scheme depicting the F3H assay for in vivo visualization of protein interactions: 
BHK cells containing a stably integrated lac-operator array were transfected with plasmids expressing a lac-repressor-GBP fusion protein, GFP-BAIT 
and mCherry/RFP-PREY. The Lac-repressor binds to the lac-operator array and recruits the GFP-BAIT through GBP. Proteins interacting with the 
BAIT are consequently enriched at the lac-operator array, (b) Co-IP with subsequent SDS-PAGE and western blotting shows interaction of the three TET 
proteins with TDG, PARPl, MBD4 but not SMUGl. GFP-tagged TET proteins and the respective mCherry-fusions were expressed in HEK293T cells 
and iminunoprecipitated with the GFP-Trap (I: Input; F: Flowthrough; B: Bound), (c) F3H was used to confirm TETl interactions with different factors 
involved in BER. GFP-TETl is enriched at the lac-operator array and mCherry-tagged interacting factors are recruited to the same spot (solid triangle). 
SMUGl shows no co-localization at the lacO-spot (empty triangle). Scale bar: 5 ]xm. (d) Quantification of the F3H assay of all three TET proteins with 
the indicated BER factors. Bars represent percentage of cells that show colocalization at the lacO-spot (n>200). (e) As in (d) but with catalytically inactive 
TET mutants. 
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The rescue experiment with wt GFP-TDG re-established 
the ability of Tdg-/- to express mCherry from the '"'CpG 
plasmid, whereas the transient rescue of Mbd4-/- cells with 
GFP-MBD4 led to no significant differences with regard to 
reporter gene expression (Figure 3a). 

High-throughput image analysis of 200 000 cells revealed 
that the signal from the oxidized plasmid in wt E14 ESCs is 
about 80% of the signal from unmodified reporter DNA. In 
Tdg- 1- cells, the ""CpG signal drops almost to the level of 
the fully methylated reporter. Thus, the ""CpG construct re- 
mains silent in Wg-/- ESCs. The stable rescue with wt GFP- 
TDG led to a recovery of mCherry expression to about 70% 
of the signal from the unmodified reporter. Knockout of 
Mhd4 and also the corresponding rescue did not alter ex- 
pression levels compared to wt E14 ESCs (Figure 3b), indi- 
cating that oxidation-dependent gene reactivation requires 
TDG but not MBD4 expression. 

To gain insight into the mechanism by which TDG medi- 
ates gene activation, we recovered the transfected reporter 
plasmid DNA from wt E14 ESCs, Tdg-j- and Mbd4-/- 
ESCs. We analyzed the modification status by digestion 
with Hpall, which specifically cuts unmodified cytosines in 
a CCGG context (Supplementary Figure Sic). We observed 
a reappearance of the Hpall restriction pattern on the 
""CpG plasmids isolated from wt E14 and MhcU-j- ESCs 
indicating that TET-dependent demethylation occurred in 
vivo. Hpall only displayed minor activity on ""CpG re- 
porter DNA re-isolated from Tdg-/- cells (Figure 3c). This 
provides strong evidence that the substitution of oxidized 
cytosine by unmodified cytosine is the major mechanism for 
the observed gene reactivation and that this substitution de- 
pends on TDG. 

TDG activity is required for reporter gene reactivation 

To investigate whether the glycosylase activity of TDG is re- 
sponsible for the observed gene reactivation, we generated 
Tdg-/- rescue cell lines, which stably express GFP-fusions 
of either wt, catalytically inactive (N151A), DNA binding 
deficient (M280H) or caC-specific (N168D) TDG (31,47) at 
equal levels (Supplementary Figure S5a). The murine caC- 
specific mutant corresponds to a published human TDG 
mutant (48). Expression of the oxidized mCherry-reporter 
plasmid in the stable rescue with wt GFP-TDG increased 
almost to the levels of wt E14 ESCs in line with the results 
from the transient rescues. In contrast, the catalytically in- 
active mutant (N 1 5 1 A) or DNA binding deficient (M280H) 
TDG was not able to recover reporter expression. Cells sta- 
bly expressing the caC-specific TDG (N168D) were only ca- 
pable of partially restoring reporter expression (Figure 4a 
and b; Supplementary Figure S5b). This demonstrates that 
base excision by active TDG is essential for gene reactiva- 
tion. 

To further characterize the activity of wt TDG as well 
as the TDG mutants, we established an in vitro assay based 
on a defined DNA substrate with a single modification site. 
Base excision by TDG generates an abasic site, which can 
specifically be converted into a single-strand break by puri- 
fied APEXl and can be detected on a denaturing gel. Thus, 
this assay mimics the first two steps of the BER reactions 
and is also appHcable on fluorescently labeled DNA sub- 



strates, in contrast to the previously described 'nicking as- 
say', in which alkaline treatment and subsequent boiling is 
used to create single-strand breaks (49,50). 

TDG has long been known to repair GoT mismatches 
(51) and was recently found to excise fC and caC (18,25). 
We could show that wt TDG is more active on fC and 
caC than on GoT mismatches, its eponymous substrate. No 
activity was detected on hmC-containing DNA. Although 
the caC-specific TDG mutant (N168D) was able to par- 
tially reactivate reporter gene expression in vivo, we could 
only detect basal activity on the caC substrate in vitro (Fig- 
ure 4c, Supplementary Figure S5d). Since the activity of 
TDG in vitro is pH-dependent (48), we repeated the as- 
say at pH = 6.5. Under these conditions, the preference of 
the caC-specific TDG mutant (N168D) toward caC could 
be confirmed (Supplementary Figure S5d). Taken together, 
these data suggest that the excision of fC and caC by TDG 
is essential for TET-mediated demethylation and causes re- 
activation of gene expression. 

NEILl, NEIL2 and NEIL3 glycosylases can partially com- 
pensate for loss of TDG 

Besides TDG, we identified the family of NEIL glycosylases 
as interactors of TET proteins (Figure 2c). Interestingly, 
NEILl and NEIL3 have also been described as binders of 
hmC, fC or caC cytosines in ESCs in a proteome wide screen 
(16). However, their function in this context has not been in- 
vestigated so far. 

To elucidate whether the NEIL glycosylase family con- 
tributes to gene reactivation, we measured the expression 
of the modified reporter plasmids in Tdg-/- cells transiently 
overexpressing NEILl, NEIL2 or NEIL3 at similar lev- 
els (Figure 5a). Interestingly, we observed a significant in- 
crease of pOct4-GFP expression in Tdg-/- ESCs rescued 
with mCherry-NEILl, NEIL2 or NEIL3 in comparison to 
Tdg-/- ESCs. However, lower expression levels as in rescues 
with wt TDG were detected. RFP-MBD4 was not able to 
rescue the Tdg-/- phenotype (Figure 5b). 

Additionally, we isolated genomic DNA from wt E14 and 
Tdg-/- ESCs as well as from the transient rescues with wt 
TDG and NEILl, 2 and 3. Slot blot analyses were carried 
out for relative hmC, fC and caC quantifications. Genomic 
hmC was present at comparable levels in all tested cell lines 
and was not affected by Tdg knockout or NEIL overexpres- 
sion. Since TDG is able to recognize and excise fC and caC, 
accumulation of these oxidized bases was observed in Tdg- 
/- ESCs, consistent with previous reports (30,52). Rescues 
of Tdg-/- cells with transiently expressed wt TDG, NEILl, 
NEIL2 or NEIL3 resulted in decreased genomic fC, and 
for wt TDG and NEILl, also in decreased caC levels (Fig- 
ure 5c). These findings support the role of the glycosylases 
TDG, NEILl, NEIL2 and NEIL3 in active DNA demethy- 
lation and subsequent reactivation of gene expression via 
excision of fC and caC followed by BER (Figure 6). 

DISCUSSION 

In this study, we investigated the effects of the oxidized 
cytosine variants hmC, fC and caC on gene expression. 
By carrying out the enzymatic oxidation of a methylated 
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Figure 3. Oxidation of ™CpG plasmid DNA leads to TDG-dependent gene reactivation, (a) Tdg-/- ESCs were transfected witli pOct4-mCherry plas- 
mids containing either unmodified, metliylated or oxidized CpGs. Confocal images show a defect of ""CpG gene reactivation in Tdg-/- ESCs but not in 
Mhd4-/- ESCs. Transient rescue of Tdg-/- ESCs with GFP-TDG re-establishes ™CpG reporter gene expression. Cells were fixed with formaldehyde and 
counterstained with DAPI. Scale bar: 5 \Lm. (b) High-throughput image acquisition and quantification of pOct4-mCherry expression shows that oxidation 
of ™CpG sites in the pOct4-reporter results in reactivation of mCherry-expression in wt E14 ESCs and Mhd4-/- ESCs, but not in Tdg-/- ESCs. Expression 
of GFP-TDG rescues the phenotype (student's i-test, **P < 0.025, n = 200 000; error bars indicate standard deviation), (c) Analytical digest with Hpall 
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pOct4-reporter construct in vitro, we separated the gener- 
ation of modified cytosines from their further processing 
in vivo. This allowed us to directly investigate the cellu- 
lar factors responsible for gene reactivation independent of 
endogenous TET activity and regulation. In wt ESCs, we 
observed strong reporter expression from oxidized but not 
from methylated plasmids, suggesting oxidation-dependent 
gene reactivation. 

To investigate which pathway is responsible for the ob- 
served gene reactivation, we searched for potential TET in- 
teraction partners. So far, it has been shown that MBD3 



colocalizes with TETl regulating hmC-marked gene ex- 
pression (53) and that TETl, TET2 and TET3 interact 
with OGT controlling protein stability, localization and his- 
tone modification (54-57). Also, several chromatin-binding 
factors, such as HDACl, EZH2 and MeCP2, have been 
described to associate with TETl (58). However, none 
of these factors is likely to be involved in the process 
of DNA demethylation. Therefore, we performed a mass 
spectrometry-based pull-down approach in which several 
BER factors co-precipitated with TETl. To confirm these 
results, we used Co-IP and a recently described F3H as- 
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transient Tdg-/- rescues 




Figure 5. The NEIL glycosylase family can partially compensate for TDG. (a) Quantification of mCherry intensities with high- throughput imaging shows 
that transient Tdg-/- rescue ESCs express mCherry-tagged TDG, NEILl/2/3 and MBD4 glycosylases at comparable levels (n = 100 000; error bars 
indicate standard deviation), (b) The ability of the NEIL family of glycosylases to substitute TDG in vivo was monitored by expression of differentially 
modified pOct4-GFP. With ectopic expression of mCherry-NEILl, NEIL2 and NEIL3, reporter gene signal was significantly higher than in Tdg-/- cells, 
although not reaching the levels of the wt mCherry-TDG rescue. MBD4 overexpression could not compensate for loss of TDG (student's /-test, *P < 0.05, 
**/'<0.025, n = 200 000; error bars indicate standard deviation), (c) Slot blot analysis of genomic DNA isolated from wt E14, Tdg-/- and the indicated 
rescues shows constant levels of hmC and accumulation of fC and caC in Tdg-/- cells. Overexpression of wt TDG or NEILl, 2 or 3 leads to a decrease in 
genomic fC amounts but does not reach wt E14 levels. Expression of wt TDG or NEILl also reduces caC accumulation. 



say to test interaction of all three TETs with different BER 
factors in vitro and in vivo. We were able to detect inter- 
actions of TET proteins with the DNA glycosylases TDG, 
MBD4 as well as NEILl, NEIL2 and NEIL3, which excise 
damaged or oxidized DNA bases (18,25,32,59,60). Further- 
more, interactions of all three TET proteins were observed 
with PARPl, which detects single-strand breaks and mod- 
ifies repair factors by PolyADP-ribosylation (61). Finally, 
we showed TET interactions with LIG3 and XRCC 1 , which 
are recruited by PARPl to the site of DNA damage and lig- 
ate the DNA strand after the insertion of cytosine (62-64). 
The observed interactions were largely independent of TET 
activity, indicating direct protein-protein interactions. 

Our findings suggest that both TET-dependent oxidation 
of mC and subsequent excision of oxidized cytosines by the 



BER machinery take place in one large protein complex in 
a spatially and temporally coordinated manner. This close 
association enables highly efficient replacement of oxidized 
cytosines. In accordance with these results, fC and caC, in 
contrast to hmC, are detected at very low genomic levels and 
are proposed to be immediately removed after generation 
(18,19). 

Initial hypotheses proposed that hmC might be deami- 
nated to hmU by AID/APOBEC deaminases prior to base 
excision by DNA glycosylases. Suggested candidates were 
TDG and MBD4, which both have been shown to recog- 
nize GoT and GohmU mismatches (23,65) and have been 
identified as TET protein interactors in this study. TDG has 
also been described to be active on fC and caC (18,25). In 
contrast to wt ESCs, no gene reactivation on the oxidized 



8602 Nucleic Acids Research, 2014, Vol. 42, No. 13 



MAIN PATHWAY ALTERNATIVE 




1 



Figure 6. Two alternative pathways for TET-mediated active demethyla- 
tion. TDG is the major glycosylase that removes fC or caC, generated by 
TET proteins. Ahernatively, the NEIL glycosylases can excise oxidized cy- 
tosines, but less efficiently. Both pathways are completed by the BER ma- 
chinery. 



reporter plasmid was observed in Tdg-/- cells. Knockout of 
Mbd4, however, had no effect on reporter gene expression, 
although MBD4 interacts with TETl, TET2 and TET3. 
Apparently, this association does not contribute to gene ac- 
tivation and may be involved in a different regulatory mech- 
anism. 

Our data suggest that conversion of hmC to hmU and 
subsequent excision by MBD4 does not play a major role 
in ESCs and are in line with previous studies showing that 
AID is unable to operate on double-stranded DNA and 
no detectable deamination of hmC by AID/APOBEC in 
vitro or in vivo could be observed (34,66). However, we can- 
not rule out that AID/APOBEC is involved in the TDG- 
dependent demethylation pathway or that this pathway con- 
tributes to TET3-dependent active demethylation in zy- 
gotes (67). Restriction enzyme -based analysis of the oxi- 
dized plasmid DNA recovered from wt E14 ESCs provided 
evidence that conversion of oxidized cytosine to unmodi- 
fied cytosine led to gene activation. Again, this effect was 
dependent on TDG but not on MBD4. 

Additional in vivo experiments with different TDG mu- 
tants showed that TDG activity and not the recruitment of 
unknown factors through TDG is essential for the recov- 
ery of gene expression. The specificity of TDG toward fC 
and caC, but not hmC, was confirmed using a newly estab- 
lished assay based on the ability of APEXl to recognize 
glycosylase-generated abasic sites and convert them into 
single-strand breaks. The results also revealed that TDG ac- 
tivity is much higher on fC or caC than on a GoT mismatch, 
arguing that oxidized cytosines are the major substrate for 
TDG and that deamination is not necessary for gene reac- 
tivation. 

Besides TDG and MBD4, we also investigated the role of 
the NEIL glycosylase family in TET-mediated demethyla- 
tion. NEILl, NEIL2 and NEIL3 have been shown to excise 



several lesions resulting from DNA oxidations, such as 5- 
hydroxyuracil or thymine glycol (59,68). NEIL glycosylases 
are bifunctional, i.e. are also able to convert abasic sites into 
single-strand breaks (69). NeiB-/- mice have been reported 
to be viable as well as fertile, and the expression of NEIL3 
is elevated in hematopoietic tissues, suggesting a function in 
the immune system or hematopoiesis (70). 

Since the NEIL glycosylases have been described as po- 
tential binders of oxidized cytosines (16), we tried to com- 
pensate for the loss of TDG in Tdg-/- cells with each of 
the three NEIL proteins. Indeed, we could detect a signifi- 
cant reactivation of reporter gene expression, although not 
reaching the levels of the rescue with wt TDG. We conclude 
that the NEIL glycosylases can also initiate BER after TET- 
mediated cytosine oxidation. This hypothesis was further 
confirmed by the fact that the accumulation of genomic 
fC and caC in Tdg-/- ESCs was far less prominent when 
NEILl, 2 or 3 was overexpressed. These data clearly show 
that the NEIL proteins are not only capable of reactivating 
the oxidized reporter gene, but also of excising formylated 
and carboxylated cytosine in its chromatin context. Thus, 
the NEIL glycosylases may constitute an alternative path- 
way for active demethylation and reactivation of epigeneti- 
cally silenced genes (Figure 6). 

In summary, we show that the TET proteins interact with 
a set of factors involved in catalyzing the multiple steps of 
BER. Furthermore, we demonstrate that TDG is the main 
glycosylase in TET-mediated reactivation of the epigeneti- 
cally silenced Oct4 promoter via the BER pathway. It would 
be of interest whether the activity of the TET-BER machin- 
ery differs on other promoter types, such as CpG island con- 
taining promoters. Our results also indicate that the NEIL 
family of glycosylases can functionally replace TDG. It re- 
mains to be elucidated to which extent the NEIL glycosy- 
lases contribute to TET-mediated demethylation and gene 
reactivation and how the usage of different glycosylases is 
regulated in vivo. 
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